Non-negative matrix factorization and its applications to audio signal processing
نویسنده
چکیده
In this paper, I will give a brief introduction to a data analysis technique called non-negative matrix factorization (NMF), which has attracted a lot of attention in the field of audio signal processing in recent years. I will mention some basic properties of NMF, effects induced by the non-negative constraints, how to derive an iterative algorithm for NMF, and some attempts that have been made to apply NMF to audio processing problems.
منابع مشابه
Voice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملNon-negative Matrix Factorization and Its Variants for Audio Signal Processing
In this chapter, I briefly introduce a multivariate analysis technique called non-negative matrix factorization (NMF), which has attracted a lot of attention in the field of audio signal processing in recent years. I will mention some basic properties of NMF, effects induced by the non-negative constraints, how to derive an iterative algorithm for NMF, and some attempts that have been made to a...
متن کاملNon-negative Matrix Factorization, A New Tool for Feature Extraction: Theory and Applications
Despite its relative novelty, non-negative matrix factorization (NMF) method knew a huge interest from the scientific community, due to its simplicity and intuitive decomposition. Plenty of applications benefited from it, including image processing (face, medical, etc.), audio data processing or text mining and decomposition. This paper briefly describes the underlaying mathematical NMF theory ...
متن کاملLp-norm non-negative matrix factorization and its application to singing voice enhancement
Measures of sparsity are useful in many aspects of audio signal processing including speech enhancement, audio coding and singing voice enhancement, and the well-known method for these applications is non-negative matrix factorization (NMF), which decomposes a non-negative data matrix into two non-negative matrices. Although previous studies on NMF have focused on the sparsity of the two matric...
متن کاملItakura-Saito Nonnegative Factorizations of the Power Spectrogram for Music Signal Decomposition
Nonnegative matrix factorization (NMF) is a popular linear regression technique in the fields of machine learning and signal/image processing. Much research about this topic has been driven by applications in audio. NMF has been for example applied with success to automatic music transcription and audio source separation, where the data is usually taken as the magnitude spectrogram of the sound...
متن کامل